Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, Multi-Armed Bandits, Deep RL
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
113355
posts in
998.5
ms
Control Reinforcement Learning: Token-Level
Mechanistic
Analysis via Learned
SAE
Feature Steering
arxiv.org
·
23h
🗣️
LLMs
Rising Multi-Armed
Bandits
with Known
Horizons
arxiv.org
·
23h
♟️
Game Theory
check out this
article
on Reinforcement Learning with R:
Origins
, Real-Life Applications, and Practical Implementation
dev.to
·
2d
·
Discuss:
DEV
♟️
Game Theory
Show HN:
Fighting
the War Against
Expensive
Reinforcement Learning
cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app
·
21h
·
Discuss:
Hacker News
🗣️
LLMs
A
Conceptual
Framework for Exploration
Hacking
lesswrong.com
·
12h
♟️
Game Theory
Gibbs Measures from Deep Shaped
Multilayer
Perceptrons
link.aps.org
·
15h
🔥
PyTorch
Optimizing post-disaster road
restoration
with reinforcement learning: A
traveler-behavior-aware
approach
sciencedirect.com
·
12h
♟️
Game Theory
A training
principle
for
drifting
models
breno.bearblog.dev
·
17h
🤖
Machine Learning
AI Beyond The
Chatbot
: The New Value
Chain
seekingalpha.com
·
15h
🤖
AI
BetaZero
V2: A Diffusion Model for Setting
Boulder
Problems
evmojo37.substack.com
·
5h
·
Discuss:
Substack
🤖
Machine Learning
Owning
the AI
Pareto
Frontier
latent.space
·
6h
🤖
AI
Worlds
: A Simulation Engine for Agentic
Pentesting
dreadnode.io
·
5h
·
Discuss:
Hacker News
🤖
Machine Learning
A multi-agent reinforcement learning approach to autonomous aircraft
taxiing
with
taxiing
time, fuel consumption, and
emission
optimization
sciencedirect.com
·
1d
♟️
Game Theory
Multi AI Agent Systems with
crewAI
deeplearning.ai
·
17h
🤖
AI
The
Classifier
Layer: Spam, Safety, Intent, Trust Stand Between You And The Answer via @sejournal, @
DuaneForrester
searchenginejournal.com
·
14h
♟️
Game Theory
A “
Toolbox
”
Pipeline
for Robots That See, Read, and Act
hackernoon.com
·
4h
👁️
Computer Vision
Recursive
Language Models: Stop
Stuffing
the Context Window
nlp.elvissaravia.com
·
8h
🗣️
LLMs
My Honest And
Candid
Review of
Abacus
AI Deep Agent
kdnuggets.com
·
10h
🤖
Machine Learning
Optimal
timing
for
superintelligence
marginalrevolution.com
·
4h
🧮
Algorithms
Your AI Strategy Has a
Human-Shaped
Hole
superiortech.io
·
14h
·
Discuss:
Hacker News
🤖
AI
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help